Rede BIOFOCO: A distributed computation of Interpro Pfam, PROSITE and ProDom for protein annotation

نویسندگان

  • Edward de Oliveira Ribeiro
  • Gustavo G. Zerlotini
  • Irving Lopes
  • Victor Ribeiro
  • Alba Cristina Magalhaes Alves de Melo
  • Maria Emilia Telles Walter
  • Marcos Motta
چکیده

Interpro is a widely used tool for protein annotation in genome sequencing projects, demanding a large amount of computation and representing a huge time-consuming step. We present a strategy to execute programs using databases Pfam, PROSITE and ProDom of Interpro in a distributed environment using a Java-based messaging system. We developed a two-layer scheduling architecture of the distributed infrastructure. Then, we made experiments and analyzed the results. Our distributed system gave much better results than Interpro Pfam, PROSITE and ProDom running in a centralized platform. This approach seems to be appropriate and promising for highly demanding computational tools used for biological applications.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

NetAffx GPCR annotation database summary

Only approximately 51% of the human proteome can be annotated by the standard motif-based recognition systems [1]. These systems, currently aggregated into a single distributed system by InterPro [2], include PFAM, PRINTS, ProSite, ProDom, SMART, and SWIS-PROT+TrEMBL. PFAM consists of hidden Markov models based on hand-curated alignments of protein domains. PRINTS is a repository of protein fin...

متن کامل

The InterPro database, an integrated documentation resource for protein families, domains and functional sites

Signature databases are vital tools for identifying distant relationships in novel sequences and hence for inferring protein function. InterPro is an integrated documentation resource for protein families, domains and functional sites, which amalgamates the efforts of the PROSITE, PRINTS, Pfam and ProDom database projects. Each InterPro entry includes a functional description, annotation, liter...

متن کامل

Reference InterPro , progress and status in 2005 MULDER , Nicola

InterPro, an integrated documentation resource of protein families, domains and functional sites, was created to integrate the major protein signature databases. Currently, it includes PROSITE, Pfam, PRINTS, ProDom, SMART, TIGRFAMs, PIRSF and SUPERFAMILY. Signatures are manually integrated into InterPro entries that are curated to provide biological and functional information. Annotation is pro...

متن کامل

The InterPro BioMart: federated query and web service access to the InterPro Resource

The InterPro BioMart provides users with query-optimized access to predictions of family classification, protein domains and functional sites, based on a broad spectrum of integrated computational models ('signatures') that are generated by the InterPro member databases: Gene3D, HAMAP, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY and TIGRFAMs. These predictions are provided...

متن کامل

InterProScan - an integration platform for the signature-recognition methods in InterPro

UNLABELLED InterProScan is a tool that scans given protein sequences against the protein signatures of the InterPro member databases, currently--PROSITE, PRINTS, Pfam, ProDom and SMART. The number of signature databases and their associated scanning tools as well as the further refinement procedures make the problem complex. InterProScan is designed to be a scalable and extensible system with a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genetics and molecular research : GMR

دوره 4 3  شماره 

صفحات  -

تاریخ انتشار 2004